Sentiment Classification on Polarity Reviews: An Empirical Study Using Rating-based Features
نویسندگان
چکیده
We present a new feature type named rating-based feature and evaluate the contribution of this feature to the task of document-level sentiment analysis. We achieve state-of-the-art results on two publicly available standard polarity movie datasets: on the dataset consisting of 2000 reviews produced by Pang and Lee (2004) we obtain an accuracy of 91.6% while it is 89.87% evaluated on the dataset of 50000 reviews created by Maas et al. (2011). We also get a performance at 93.24% on our own dataset consisting of 233600 movie reviews, and we aim to share this dataset for further research in sentiment polarity analysis task.
منابع مشابه
LABR: A Large Scale Arabic Book Reviews Dataset
We introduce LABR, the largest sentiment analysis dataset to-date for the Arabic language. It consists of over 63,000 book reviews, each rated on a scale of 1 to 5 stars. We investigate the properties of the the dataset, and present its statistics. We explore using the dataset for two tasks: sentiment polarity classification and rating classification. We provide standard splits of the dataset i...
متن کاملAKTSKI at SemEval-2016 Task 5: Aspect Based Sentiment Analysis for Consumer Reviews
This paper describes the polarity classification system designed for participation in SemEval2016 Task 5 ABSA. The aim is to determine the sentiment polarity expressed towards certain aspect within a consumer review. Our system is based on supervised learning using Support Vector Machine (SVM). We use standard features for basic classification model. On top this, we include rules to check prece...
متن کاملAn empirical study of sentence features for subjectivity and polarity classification
While a number of isolated studies have analysed how different sentence features are beneficial in Sentiment Analysis, a complete picture of their effectiveness is still lacking. In this paper we extend and combine the body of empirical evidence regarding sentence subjectivity classification and sentence polarity classification, and provide a comprehensive analysis of the relative importance of...
متن کاملFeature based Star Rating of Reviews: A Knowledge-Based Approach for Document Sentiment Classification
This paper presents a novel knowledge-based approach for star rating of reviews. It uses SentiWordNet and linguistic heuristics to determine sentiment orientation of sentences, which is used to assign a positive, negative and objective score to document to achieve 5-star rating of movie reviews. A method for generating ratings based on individual features is also presented. The experimental res...
متن کاملMulti Class Cross Domain Sentiment Classification Using Fuzzy Mapped Cluster
Nowadays, e-commerce is growing fast and customers make sure of quality based on product reviews. But the large no of product reviews makes it difficult to automatically classify them into different polarity classes (positive and negative). Most of the existing methods have come out for binary classification of customer reviews. The proposed work uses a fuzzy logic model for crossdomain sentime...
متن کامل